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ABSTRACT 

This study survevctJ strategies of se- 
quencing primer selection and evuluuietl 
primer performance in automated DNA se- 
quencing. We asked participants to relate 
their preferred primer design strategies to 
identify primer churacleristics that are con- 
sidered most important in sequencing 
primer design. The participants preferred 
primers of 18-24 nucleotides (nt). 
39%-5S% G+C, a melting temperature 
(TJ of 53''-6yC with a 1-2 nl 3' C,C 
clump, huirpin stems of less than 2-3 bp. 
homopolymeric runs of than 4-5 nt, 
primer dimers of less than 3-4 bp and sec- 
ondary priming sites of less than 3-4 bp. We 
provided a 300-bp test sequence and asked 
participants to submit sequences of 1-3 op- 
timal sequencing primers. Submitted 



primers ranged from 17-24 nt and largely 
conformed to the preferred parameters. 
Submitted primers vnere distributed across 
the test sequence, although some sites were 
disfavored. Surprisingly, approximately 
45% of the primers were selected "munuul- 
ly ", more than by any software package. 
Each of 69 submitted and 95 control 
primers, distributed at 3 bp intervals across 
the test sequence, were synthesized, purified 
and tested using a Model 377 Pkism™ DNA 
Sequencer with dichlororhadamine dye ter- 
minator reagents (dRhodamine dye termi- 
nators). Approximately half of the control 
primers were also tested using rhodamine 
dye terminator reagents ("old" rhodamine 
dye terminators). The results indicated that 
primer physico-chemical characteristics 
thought to have a strong impact on sequenc- 
ing performance had surprisingly little ef- 
fect. Thus, primers with high or low percent 
G-^-C or T^. strong secondary priming 
scores or long 3' homopolymeric stretches 
yielded excellent ■iequences with the dRho- 
damine dye terminator reagents, although 
these characteristics had a stronger effect 
when the old rhodamine reagents were 
used. The old rhodamine reagents gave se- 
quences with a similar average read length, 
but the number of errors and ambiguities or 
"N's" was con.sistently higher. .'Moreover, 
the effects of the primer physico-chemical 
characteristics were also more evident with 
the old rhodamine dyes. We conclude thut 
under optimal sequencing conditions with 
highly pure template and primer, many of 
the commonly applied primer design para- 
meters are dispensable, particularly when 
using one of the new generation of sequenc- 
ing reagents such as the dichlororhodamine 
dye terminators 



INTRODUCTION 

The Nucleic Acids Research Com- 
miltee (NARC) of the Association of 
Biomolecular Resource Facilities 
(ABRF) evaluates the procedures and 
performance of research facilities that 
perform nucleic acid synthesis and se- 
quencing in academic, industrial and 
other institutions (5,6). These studies 
provide a measure of the industry state- 
of-the-art, as well as the level of perfor- 
mance, that can be expected from such 
facilities. Previous projects have exam- 
ined general operations of DNA core 
laboratories (15), the accuracy of auto- 
tnated DNA sequencing in these facili- 
ties (12), and the performance of unpu- 
rified synthetic oligonucleotides as 
primers for automated DNA sequence 
analysis (14), The DNA Sequencing 
Research Committee of the ABRF has 
performed studies evaluating the ability 
of DNA core facilities to sequence dif- 
ficult templates (2) and the perfor- 
mance of various sequencing strategies 
and chemistries (I), Herein, we have 
studied primer selection strategies used 
by DNA sequencing facilities and em- 
pirically examined the quality of the se- 
quence data provided by primers select- 
ed by these facilities. 

Despite a long-standing conviction 
within the DNA sequencing communi- 
ty that careful primer design is essential 
to ensure high quality data (c.f, 
I 1,14,17,18), there is a paucity of em- 
pirical studies supporting this view. A 
variety of reasonable "rijies of thumb" 
(13) suggesting optimal ranges for 
primer length, percent G+C, melting 
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Tlble I. Predicted Optimal and Observed Physico-Chemical Characteristics of Primers for Automated DNA Sequencing or PCR 



Sequencing Primers* PCR Primers'^ Submitted Primers'^ 

Optimum<< Range* Optimum*' Ranged Mean' Ranges 



Length 


18-24 nt 


15-^0 nt 


18-29 nt 


15-10 nt 


19.7 nt 


16-24 nt 




52°-66=C 


40=-95°C 


so^-egx 


40°-95=C 


66°C 


48°-9rC 


Percent G+C of primer 


40%-61% 


30%-70% 


40%-61% 


30%-70% 


53% 


29%-82% 


3' GC clampi 


1-2 m 


0-5 nl 


1-2 nt 


0-2 nt 


1.1 nt 


O^nt 


Primer dimeri 


3-4 bp 


0-7 bp 


3-4 bp 


0-7 bp 


-5.4 


-1.6-16.5 


Hairpin stems'* 


2-3 bp 


0-6 bp 


2-3 bp 


0-6 bp 


1.1 bp 


0-5 bp 


Homopolymer runs' 


4-5 nt 


3-10 nt 


4-5 nt 


3-1 Ont 


3.0 nt 


2-Snt 


Secondary priming"" 


3--4 


0-10 bp 


3-4 bp 


0-10 bp 


49" 


0-266" 



aParameter values preferred by participants for primers to be used in sequencing. 

"Parameter values preferred for primers to be used in PCR experiments. 

t^Parameter values of the primers submitted by participants. 

<^0p1imum values for each parameter as selected by the participants, 

ejhe range of acceptable values selected by the participants. 

'The mean values for each parameter calculated from the submitted primers. 

9The range of values for each parameter calculated from the submitted primers. 

^Values submitted by participants for primers to be used for sequencing or PCR and nearest neighbor of primers submit- 
ted by participants. 

iThe 3' clamp refers to the number of G or C bases at the 3' terminus of the primer. 

iLargest number of contiguous complementary bases permitting dimerization of primer or the stability of stfongesl potential 
primer dimer (kcal) as measured by Oligo 5.0. 

^Largest number of bases capable of forming a hairpin stem in the primer. 

'Number of bases in longest homolopolymeric run in the primer. 

""Number of bases permitted in most stable secondary priming site. 

"The priming efficiency as calculated by Oligo 5.0 for test sequence only (submitted primers). 



temperature (T,^) and composilion have 
led to the development of primer design 
software packages that identify primers 
conforming to these criteria (4,16,18). 
However, Ihc success of these strategies 
in selection of high-quality sequencing 
piitners, to our knowledge, has not been 
extensively examined. In many se- 
quencing projects, primer design and 
synthesis represent the most significant 
costs and consume the bulk of effort 
and time. Therefore, it is extremely im- 
portant that these primers yield high- 
quality sequence data. 

To examine this issue, an c-mail sur- 
vey containing general questions about 
laborator>' functions and specific ques- 
tions concerning important characteris- 
tics for primer selection was distrib- 
uted. A 300-bp test sequence was 
provided for which participants were 
asked to design sequencing primers. 



The submitted primers and a set of con- 
trol primers spanning (he 300-bp tesi 
sequence at 3-bp intervals were synthe- 
sized and used to sequence the test 
template on a Model 377 Prism™ 
Automated DNA Sequencer (PE Bio- 
systcms, Foster City, CA, USA). Due to 
the results of this study, current primer 
design rules of thumb may be stream- 
lined to facilitate more efficient primer 
selection for sequencing projects. 



MATERIALS AND METHODS 

The Survey 

In December of 1 996 and January of 
1997, members of (he ABRF were re- 
quested via e-mail distributions, the 
ABRF Electronic Bulletin Board and a 
standard mailing, to participate in a 



study of DNA sequencing primer de- 
sign. The distribution provided a 300-bp 
"test sequence" (Figure 1 ) that was se- 
lected and previously shown to contain 
no segments that affect sequence ladder 
extension (data not shown). Participants 
were asked to use the prevailing tech- 
nology in their laboratories to design 
and submit the sequences of up to three 
sequencing primers in the forward di- 
rection. Participants were requested to 
select the best primers irrespective of 
their positions on (he test sequence. All 
responses were anonymously screened 
and coded by Virginia Commonwealth 
University (VCU) Health Sciences 
Computer Center personnel. Identifying 
data were removed from each document 
before being forwarded to the Commit- 
tee for analysis. Several rcspondees 
submitted uninterpretable data. Where 
possible, these respondecs were identi- 
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fied by Computer Center personnel, and 
new information was requested. Every 
elTort was made to maintain the integri- 
ty of the data and the anonymity of the 
participants. 

Control and Submitted Primers 

A panel of 95 primers were synthe- 
sized as conlrols. These 18 nucleotide 
(nl) primers spanned the 300-bp lest se- 
quence with their 5' termini located at 
3-bp intervals (i.e., the first began at bp 
1 of the lest sequence, the second began 
at bp 4, etc.). All primers were ana- 
lyzed with the OligoTM program 5.0 
(NBl/Genovus, Plymouth, MN, USA) 
(16) and the number of bases, percent 
G-l-C, Tn, (nearest neighbor method de- 
scribed in Reference 9), internal struc- 
ture, secondary priming sites, etc., were 
recorded. 

The 69 submitted and 95 control 
primers were synthesized on a Model 
3948 DNA Synthesis and Purification 
System (PE Biosyslems) at three differ- 
ent sites. Synthesis was performed at 
the 40 nmol scale using PE Biosystems 
reagents and the standard synthesis, 
cleavage and purification cycles. An 
aliquot of each sample was analyzed by 
polyacrylamide gel electrophoresis 
(PAGE) and by matrix-assisted laser 
desorption ionization time of flight 
mass spectrometry (MALDI-TOF/ 
MS). PAGE was performed using pre- 
cast minigels (Novex, San Diego, CA, 
USA) following the protocol specified 
by the manufacturer. Mass analysis was 
performed on a 200-pmol sample 
mixed with 10 mg/mL 3-hydroxypicol- 
inic acid matrix (in 50% acctonitrilc, 
0.1% trifluoracetic acid) on a Voyager 
RP MALDI-TOF/MS inslniment (Per- 
Scptive Biosystems, Framingham, MA, 
USA). Criteria for passing quality con- 
trol were: (/) a single major band on the 
gel; (;7) appropriate mobility on the gel 



compared to molecular weight stan- 
dards; (;■(/■) a synthesis yield >0.5 ODU 
(absorption of sample in I mL at 260 
nm with 1 cm palhlength set up on the 
3948 Synthesizer); and (iv) a single 
peak of the expected mass by MALDI- 
TOF/MS- Any oligonucleotides that 
failed to meet these criteria were resyn- 
thesized. 

Test Template and Preparation 

The plasmid template was preselect- 
ed to contain a test sequence lacking 
obstacles to sequence extension (data 
not shown) and purified by double 
banding in CsCI-ethidium bromide iso- 
pycnic density gradients (10). 

DNA Sequence Analysis 

Each of the purified and quality-test- 
ed oligonucleotides were used as a 
primer for DNA sequence analysis us- 
ing dRhodamine Terminator Cycle Se- 
quencing Ready Reaction Kit with Am- 
pHTaq® FS reagents (PE Biosystems) 
under standard reaction conditions as 
described by the manufacturer at 20 
total vol containing 300 ng of template 
and 5 pmol of primer. Sequencing reac- 
tions were run for 25 cycles of 10 s at 
96''C, 5 s at 50°C and 4 min at 60°C in 
a GeneAmp® PCR System 9700 DNA 
Thermal Cycler (PE Biosystems), Re- 
actions were analyzed on Model 377 
Prism Automated DNA Sequencers us- 
ing 5% Long Ranger™ (PMC BioProd- 
ucts, Rockland, ME, USA) gels with 
Tds-Borate-EDTA buffer (TBE; pH 
8,3). .Approximately 30% of the control 
primers were rerun using Ihe same 
dRhodamine dye terminalor/Taq FS kit 
to ensure reproducibility, and approxi- 
mately 50% of the control primers were 
rerun using the old rhodaminc dye tcr- 
minator/Taq FS kit (PE Biosystems). 
The dRhodamine or dichlororhodaminc 



reagents are modifications of the origi- 
nal old rhodamine dyes that have nar- 
rower emission spectra, less spectral 
overlap and more even peak heights 
than the earlier dyes (19). Each 5% 
polyacrylamide sequencing gel was 
cleclrophoresed for approximately 8 h, 
and the data was directly exported for 
analysis using Sequencher'^" (Gene 
Codes, Ann Arbor, Ml, USA). Using 
this software, the raw data was trimmed 
to remove 5' and 3' ambiguous se- 
quences, so that the first and last 25 nt 
contained no ambiguities. Each result 
was aligned with the known sequence, 
and we recorded: (/) the number of nt 
between the 3' end of the primer and 
the beginning of Ihe trimmed data; («) 
the number of nt in each sequence read; 
and (Hi) the number of errors, including 
miscalls, insertions, deletions and am- 
biguities in each lOO-bp window 
through the end of the sequence read. 



RESULTS 
Participant Profiles 

Thirty-nine laboratories participated 
in the study, submitting 69 primer se- 
quences. Of the participants, approxi- 
mately 77% provide synthesis services 
and approximately 85% provide se- 
quencing services, but only approxi- 
mately 50% provide primer design or 
walking services, and less than one 
third offer template preparation. Labo- 
ratories performing these services aver- 
aged annually approximately 3700 syn- 
theses, 7000 sequence runs, 425 primer 
designs, 424 template purifications and 
less than 200 primer walking projects. 
Charges for services averaged S 1 . 1 7/nt 
for 40-50 nmol and S2.19/nl for 
15()-200 nmol syntheses of a 25-nl un- 
purified primer and S24/reaction for 
DNA sequence analysis (S9 for gel 
only). The average price for a primer 
walking sequencing project of a 2-kbp 
insert in a plasmid vector was approxi- 
mately S500 or S0.25/bp. Charges for 
"external users", where available, aver- 
aged approximately 17%-50% higher 
than charges for institutional users. 
DNA synthesis and sequencing instru- 
mentation was dominated by PE 
Biosystems; i.e., 77% of synthesizers 
and 92% of sequencers, but Beckman 



1 ATCCACACGG CGCTTTGACT CCCCTTCTGA AAAAAAAGAA CACGTTGATT 

51 ccccrrrfGA tttcccacga atgcgaacca caattcaccc acagccaacg 

101 CCACGGACyu3 ACGATAAGGT AAAATACACA CAAGTCGTOG AATCACAAAA 

151 GGACACGCAC AGCACGCAAA TAAACATTGC ACACGGCATG ACAnTTGCa 

201 ACCCACACAC AATCTGTACA CTACAATTAT ATGAAAATCC CTCCCCCATT 

251 CCGCGGGTGC GCCXMTAAAGG GCCAAAAACC CAAATGATCC ACITTATTAr 



Figure 1 , Test sequence. Shown is the 300-bp sequence distributed to pan Icipants for design uf forwarO 
sequencing primers. Participanis were asked to design primers in the Torward direction. 
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Inslruments, PerSeptive Biosystems 
and Ll-COR were also represented. 
Only approximately 1 5% of the partici- 
pants maintained robotic systems for 
template preparation or sequencing. Fi- 



nally, when asked if commercial sup- 
pliers had an impact on services provid- 
ed by the facilities, almost 80% of the 
laboratories responded affirmatively. 
Approximately 38% had dropped 




Figure 2. Primer characteristics vs. sequencing perrormancc. The perrornijnce of sequencing 
primers, sorted according to various physico-chemiciil characteristics, was examined. The number ofer- 
rors over the full-lengih read of each primer, determined as described in the Materials and .Vteihods, was 
plotted against the resuhs of each primer sorted according to the relevant parameter. (A) Primer length 
vs. errors. The primers were sorted according to their lengths in ascending order, and secondarily, ac- 
cording to their positions on the test sequence. The number of errors over the Aill-lenglh read is plotted. 
Noie that since all of ihe primers tested with the old rhodamine reagents were controls, and therefore 1 8- 
nt long, the daia for old rhodamines are not included in this analysis. The X-asis distributes the primers 
(rum 16- 24 nl in length; the y-axis indicates the number of errors over the full-lengih read of that priiin-r 
(U) Position of the primer on the test sequence, The primers were sorted according to the position of their 



5' nl 0 
plotted (y. 

(9r-37=C 



;e(bp I 2S3 oi 



I. (C) T„, of the 1 



r. The primers were sorted in descending order by thetr T,„'s 
jf errors in their full-length reads were plotted (y-axis). (D) The 
s were sorted in deseending order (82%-17% on x-axis) by their 
: in their full-length reads were plotted (y-axis). (F.) Homopoly- 



percenl G^C, and the number oferri 
nieric stretches in the primer. The primers were soncd accoi 

in the longest homopolymerie sireteh, and secondarily aceording to the number of ni in the sc 
longest homopolymerie stretch. The number oferrors over the full-length read were plotted (y-axis 
Secondary prtming efficiency of the primers. Rach primer was examined for secondary primmg si 
the vector and across the test sequence using Oligo 5.0. The priming eftlcieney score assigned by Oligo 
5 .0 was determined. The primers were sorted first according to the score of the strongest secondary prun- 
ing site in the vector (304- 42 on x-axis) and secondarily according to the strongest secondary priming 
score in the test sequence. The number oferrors across the ftill-length reads were plotted (y-axis). 
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Table Z. Average Primer Performance! of Sjbmined and Conirol Primers Using dRhodamine or Old Rhodamine Dye Terminator Sequencing Kits* 



5' nt trimmed^ 
read length m nt<^ 
ambiguities {N's)e 
errors per first 700 nt' 



Submitted Primers 
dRhodamines'> 

3.3(2, 1-14) 

760 (770, 674-802) 
2.2(1,0-16) 
2.9(2, 0-14) 



Control Primers 
dRhodamines') 



Control Repetition Control Primers 
dRhodamines" old kit dyes'} 



4.6(4,1-19) 4(3,0-16) 4.5(4,0-16) 

772 (776, 683-799) 798 (803. 733-667) 752 (764. 654-803) 

1.6(1,0-32) 4.4(4.1-17} 8.7(7,0-58) 

2.9(2,0-46) 5.4 (5,3-16) 16.5(12,3-92) 



^Old rliodamine l<its refer to the original ABI Prism Dye Terminator Cycle Sequencing Kits, dRhodamine kits refer to the new 
ABI Prism dRhodamine Terminator Cycle Sequencing Ready Reaction Kits containing the dichlororhodamine dye termina- 
tors. 

bThe data is shown as the average with the (median, range) in parentheses, 

cThe number of nt between the 3' end of the primer and the beginning of the trimmed sequence. 

''The length of read In nt after trimming using Sequencher (see Materials and Methods). 

^The number of ambiguous calls (N's) In the fuil-length sequence read. 

The number of errors in the first 700 nt of the sequence. 



prices and approximately 25% indicat- 
ed a decreased number of synthesis re- 
quests in response to commercial sup- 
pliers. About 23% of the laboratories 
purchased or planned to purchase large- 
scale synthesis instruments to match 
the new competition. However, only 
approximately 5% had discontinued 
synthesis services. 

Software or Manual Selection of 
Primers 

Thirty-one laboratories described 
their practices for (he design of se- 
quencing and PCR primers. We were 
surprised that, in spite of the availabili- 
ty of excellent commercial primer de- 
sign software programs, over 50% 
(17/31) of the reporting laboratories 
preferred to select primers manually. 
Of the 25 laboratories reported that se- 
lection and design of PCR primers, 
again over 50% (14/25) tended not to 
use software. Of those laboratories thai 
rely on software for primer selection, 
over 60% rely on various versions of 
the Oligo software for sequence primer 
design, and over 70% use Ihc Oligo 
software for PCR primer design. 

Idealized Primer Physico-Chemical 
Characteristics 

Participants were aslced to provide 
their preferences for the physico-chem- 



ical characteristics of the ideal sequenc- 
ing and PCR primers. This analysis 
provided few surprises (Table I ), in that 
the idealized primer characteristics of 
most respondents closely parallel those 
of some common rules of thumb for 
primer design. Thus, sequencing pri- 
mers should be: 18-24 nl in length, T^ 
= 52°-66°C, 4 00.4-60% Gi C, with 1-2 
bp 3' GC clamps (G or C nt at 3' termi- 
nus of the primer) and very little inter- 
nal structure, homopolymeric sequence 
or potential for secondary priming. A 
similar setof characleristics was report- 
ed for PCR primers, although the opti- 
mal acceptable length, Tu, and percent 
G-i-C were slightly greater than for se- 
quencing primers. However, it was un- 
expected that some participants accept 
primers of up to 40 bases, with T|^'s as 
high as 95°C for both sequencing and 
PCR primers. When asked to rate the 
relative importance of each characteris- 
tic, participanis rated the existence of 
secondary priming sites and percent 
G-i-C as most important, with Tj,,, pres- 
ence of a 3' GC clamp, the potential to 
form primer dimcrs, primer length, the 
presence of internal structure or ihc 
presence of homopolymeric runs, in or- 
der of decreasing importance for se- 
quencing primer design. Relative im- 
portance of these characteristics was 
similar for PCR primers, except thai the 
potential for formation of primer 
dimers was rated almost as important as 



the existence of potential secondary 
priming sites. 

Characteristics of tiie Submitted 
Primer Sequences 

The 39 participating laboratories 
submitted 69 primer sequences ranging 
in length from 16-24 nt with an aver- 
age of approximately 20 nt. The 
primers were distributed fairiy evenly 
across the 30O-bp test sequence, al- 
though some segments were obviously 
purposefully excluded; e.g., any primer 
beginning between bp 2 1 and 30 of the 
test sequence would bear an 8-nt 
poly(A) segment fnol shown). Not sur- 
prisingly, most of the submitted pri- 
mers conformed closely to the ideal- 
ized physico-chemical characleristics 
for sequencing primers (Table 1). Al- 
though some deviated from the ideal- 
ized characteristics, all conformed to 
the more broad ranges defined by par- 
ligipanis. Thus, the participants select- 
ed primers according to their idealized 
criteria, even though almost half of the 
submitted primers were selected manu- 
ally without the aid of computer soii- 
warc. Interestingly, some of the submit- 
ted primers had very high or low T,^ 
(91° or 48°C) or percent G-hC (82% or 
29%), indicating that the participanis 
do in fact permit a significant variation 
in these characteristics. Moreover, 
some primers exhibited relatively high 
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secondary priming elTiciency numbers 
(a score calculated by Oligo 5.0, 
whereby a priming efficiency of 160 
can prime in sequencing reactions) 
even within the test sequence (Table 1 ). 

Sequencing Performance of 
the Primers 

The results of the sequence analysis 
as described in the Materials and Meth- 
ods were unexpected (Table 2); i.e., we 
expected a wide range in the perfor- 
manee of the submitted and control 
primers. Instead, all of the submitted 
primers functioned extremely well, the 
poorest performer still yielding <2% 
errors over a >700-nl sequence. The av- 
erage number of errors and ambiguities 
over the 700-nt window was 2.9, and 
the average read length was 760 nt. 
Similarly, almost all of the control 
primers functioned very well, with an 
average of 2.9 errors or ambiguities 
over the 700-nt window and an average 
read length of 772 bases. Only one 
primer (No. 8) failed to give any rea- 
sonable sequence, and even this 
oligonucleotide performed adequately 
on the repeat; i.e., 13 errors and ambi- 
guities over a 700-nt read. The average 
read lengths of the sequences from the 
submitted and control primers were 
760 and 772, with ranges of 674-802 
and 683-799 bases, respectively. The 
average number of nucleotides trimmed 
from the 5' ends of these sequences was 
3.3 and 4.6 nt, respectively. Many se- 
quences began on the first nucleotide 
after the primers, but up to 14 or 19 nt 
were trimmed from the 5' ends of the 
submitted and control primers. As ex- 
pected from previous studies (1,2,12, 
14,18), errors and ambiguities were 
clustered in the firsl and last 100-bp 
windows of the sequences (data not 
shown). Approximately 30% of the 
control primers were used in repeat re- 
actions with very similar results (Table 
2, and data not shown). 

The control primers used in se- 
quence reactions using the old rho- 
damine dye terminator reagents gave 
slightly poorer overall results. The read 
lengths of these reactions did not differ 
greatly from the read lengths of the 
dRhodamine reactions (average 752 nt 
vs. 772 bases, Table 2). However, there 
were more N's (average 8.7 vs. 1 .6) and 



errors and ambiguities (average 16.5 
vs. 2.9) in the data generated with old 
rhodamine dye terminators than in the 
dRhodamine-gencrated data (Table 2). 

Poor Primers Yield Good 
Sequence Results 

The uniform high-quality sequence 
dala from the control primers using the 
dRhodamine reagents was not expect- 
ed. Only primer No. 8 failed calaslroph- 
ically (i.e., gave no valid sequence 
data), and primer No. 9 yielded >5% er- 
rors over a 700-nt window. However, 
both of these primers provided better 
data when used in a repeal sequencing 
reaction (Table 3). Moreover, there was 
no obvious relationship between the 
number of errors in a sequence and the 
length, position, T^,, percent G+C, sec- 
ondary priming potential, length of ho- 
mopolymeric stretches (Figure 2, pan- 
els A-F) or the potential for forming 
primer dimers (data not shown) of the 
primers. On the contrary, many primers 
exhibiting suboptimal physico-chemi- 
cal characteristics yielded high-quality 
sequence. For example, although the 
one primer, No. 8, that failed castas- 
trophically has an 8-nt poly(dA) stretch 
near ils 3' terminus, other primers, e.g., 
Nos, 7 and 10, contain a similar se- 
quence, but still generated very re- 
spectable data (Table 3). Primer No. 9, 
which also contains (he same poly(dA) 
stretch, also yielded relatively poor data 
(46 errors in 700 bases), although the 
error rate was <3% over the first 500-nl 
window (data not shown). Examination 
of the sequences of these primers sug- 
gests that homopolymeric stretches of 
dA's may have a negative impact on se- 
quence quality, but that these stretches 
must be quite long, and near to the 3' 
terminus to exert their effects. Other 
primers with extensive homolopoly- 
merie stretches yielded good results 
(Table 3). Finally, when the length of 
the longest homopolymeric stretch in 
each of the primers was plotted again.sl 
the number of errors in the sequence re- 
action (Figure 2E), no general trend 
was observed when reactions were per- 
formed with dRhodamine reagents. To- 
gether, these observations suggest that 
under optimal conditions using dRho- 
damine reagents, long homopolymeric 
stretches do not necessarily negatively 
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Tible 3. Primeri wtlh Physlco-Cbemical Characteristtct that Were Not Ex|Kcied to Provide Good Sequencinu Results 



Primerff Sequence 



Errors' 



'old'Rhodainigei 



UJ*C 



rttKUrn icul read per 700 m lotri read 



per TOO m imI t—d 



l/TIl 

T/717 



f/IOJ 

11/T>0 



u/ett 

<3/SM 

13 /TU 



*1*C » 

•rc »i 



«/r7i 

U/771 



17/7M 
ll/l«7 

»/7aa 



U/703 
13/111 
ia/718 




I I III! Hill! 



lErrors and ambiguities per 700 nt read, or per total read length. For the dRhodamines, both the initial results and the re- 
sults of the repeat analysis are shown. 

^Length of the longest and second longest homopolymeric stretches in the primer. 

3X/Y where X is the priming efficiency number generated by Oligo version 5.0, for potential secondary priming in the vector, 
and Y is the value for a perfect match with that primer. In general, the higher the score, the higher the stability and quality 
of the priming at that site. A score of >200 will often give false priming results according to the Oligo 5.0 manual. 

nd, not determined 



eflccl sequence results. 

and percent G+C also showed 
less impact than expected on the dRho- 
daminc reactions (Figure 2, C and D 
and Tabic 3). Thus, primers with low 
Tn, and percent G+C; e.g., 37^0/28% 
and 44°C/28% for primer Nos. 72 and 
39, and others with high Tn,/percenl 
G+C; e.g., 75°C/72% and 83''C/78% 
for control primer Nos. 80 and 84 and 
91°C/82% for submitted primer No. 



4321b, each provided sequence at well 
under 1% error rate (Table 3). Sec- 
ondary priming potential also exhibited 
less of an obstacle for dRJiodamine se- 
quencing than was anticipated. Primers 
with very high secondary priming 
scores at sites within the template pro- 
vided excellent sequence data. Control 
primer Nos. 82 and 83, and submitted 
primer Nos. lOAb and 49b, had high 
secondary priming scores, and in each 



case the homology encompassed the 3' 
end of the primer (Tabic 3), Nonethe- 
less, these primers yielded very high- 
quality results. 

Finally, wc compared the sequenc- 
ing results generated with primers se- 
lected by software to the results of 
manually selected primers and to the 
arbitrarily selected control primers. 
There was no observable difference in 
the results obtained; i.e., the average 
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number of errors over a 7(X)-n1 se- 
quencing window was 2.9, 3.1 and 3.3 
for all submitted primers, for submitted 
primers selected using software and for 
the control primers, respectively. More- 
over, no clear trends were observed 
when comparing the physico-chemical 
characteristics of the soflware-designed 
primers to the manually designed or the 
eonlrol primers (data not shown). 

Old Rhodamines vs. dRhodamines 

The old rhodamine dye reactions 
were more strongly impacted by primer 
physico-chemical characteristics than 
the dRhodaminc dye reactions. As de- 
scribed above, the old rhodamine dye 
reactions were consistently poorer than 
the dRhodamine reactions (Table 2). 
Moreover, the former reactions seemed 
more sensitive to primer characteristics. 
Thus, reactions with higher error rates 
tended to be those with lower Tn,, lower 
percent G+C and longer homopolymer- 
ic stretches (Figure 2, panels C-E). As 
with the dRhodamines, the old rho- 
damine reactions exhibited no discem- 
able trend in relation to secondary 
priming potential (Figure 2F), or primer 
secondary structure (data not shown). 



DISCUSSION 

In this study, we examined the per- 
ceptions and strategics of participant 
core facility personnel for the design 
and selection of DNA sequencing and 
PCR primers, The results may reveal 
some unexpected misconceptions in the 
requirements for good sequencing 
primer design. Thus, most laboratories 
espouse primer design dogma suggest- 
ing that a sequencing primer should be 
18-24 nt in length with a T^, = 
52°-66°C, 40%-60% G+C, 1-2-bp 3' 
GC clamps and little internal structure, 
homopolymeric sequence or potential 
for secondary priming. The percent 
G+C and secondary priming potential 
were rated as the most important char- 
acteristics, but Tjjj, GC clamps, primer 
secondary and tertiary structure, length 
and the presence of homopolymeric 
njns were also thought to be relevant. 
Moreover, previous studies have shown 
that secondary structure of the template 
in the primer target regions can effect 



priming efficiency (7,8), 

With regard to primer design, we 
were surprised to find that the majority 
of investigators design most of their se- 
quencing primers manually. Thus, in 
spite of the perceived importance of 
primer characteristics and the wide- 
spread availability of primer design 
software, approximately 55% of the 
participants in this study elected to de- 
sign primers manually. We believe that 
this observation is the result of a com- 
bination of the facts that primer design 
software is not yet sufficiently "user- 
friendly" and that the primer physico- 
chemical characteristics are not as im- 
portant as commonly believed. It would 
seem that the authors of primer design 
software could take note of this result 
and streamline and enhance their prod- 
ucts so that they are more convenient, 
beneficial and practical to use. Clearly, 
however, there are many circumstances 
(e.g., templates with GC-rich regions or 
with unusual sequences or structures, 
genomic sequencing or the sequencing 
of very large templates with many re- 
peat regions) in which it would be ad- 
visable to use appropriate primer de- 
sign software to automate and expedite 
primer design and synthesis. 

The results of the empirical se- 
quencing analysis were surprising in 
that nearly all of the primers yielded 
data of extremely high quality. Excel- 
lent data was obtained in spite of wide- 
ly varying primer T^ and percent G+C, 
the presence of ncariy 50% of the 
primer as homopolymcr and the pres- 
ence of very strong secondary priming 
sites, characteristics considered by 
most respondents lo be the critical for 
sequencing primers. This study con- 
firms earlier observations (3) that there 
is a broad tolerance in these and other 
characteristics of sequencing primers. 
Other factors, such as template purity 
or quality or technical expertise, possi- 
bly play greater roles. In this study, the 
plasmid template was selected for ab- 
sence of sequence extension obstacles 
and purified by double banding in 
CsCl-ethidium bromide isopycnic den- 
sity gradients. Therefore, this template 
was extremely pure and optimal for se- 
quencing. The primers were similarly 
highly purified. The reactions were per- 
formed in a single high-throughput 
sequencing facility under tightly 
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controlled conditions. DifTerent results 
may be obtained using less carefully 
purified DNA templates with unusual 
sequences or structures or in less rigor- 
ously controlled sequencing operations. 
Finally, the data generated using old 
rhodamine reagents was, in general, 
poorer in quality than that generated us- 
ing the dRhodamine reagents. More- 
over, the expected trends in comparing 
sequence performance and primer char- 
acteristics were more prominently ob- 
served using the old rhodamine 
reagents. We believe it possible, if not 
likely, that the greater uniformity in 
peak height generated by the newer se- 
quencing reagents (19) is at least par- 
tially responsible for this difference. 
Thus, background peaks are less likely 
to interfere with base calling, lessening 
the overall effects of potential problems 
like secondary priming and stutter. 



ACKNOWLEDGMENTS 

We would like to thank the ABRP 
for sponsoring this study and its mem- 
bers for their participation. Wc also 
thank John Fritz of the VCU Health 
Sciences Computing Center for his as- 
sistance in electronic distribution of the 
study and anonymous receipt of the re- 
sponses. The ABRF is a nonprofit orga- 
nization devoted to promoting excel- 
lence in biotechnology core facilities. 
Information about the ABRF and its 
programs can be obtained from the 
ABRF Business OfTice (Tel: 301-571- 
8300) or the ABRF Web site (http:// 
www,abrf.org). 



REFERENCES 

1. Adams, P.S., M.K. Dolejsi., C. Grills, S. 
Hardin, D.L. Mc.Minimy, P. Morrison, J. 
Rush, S. Gorret at. 1 998. An analysis of tech- 
niques used lo improve tlie accuracy of auto- 
mated UNA sequencing of a GC-rich template: 
resulu from the 2nd ABRF DNA T lucncc 
Research Group Study. 1. Biomol. lecliniq. 
9:9-IS. 

2. Adim$, P.S., .M.K. DolcJsl,S. Hardin, S. Mls- 
che, B. Nanthakumir, H. Ricthman, J. Rush 
■nd P. Morrisson. 1996. DNA sequencing ofa 
moderately dilTlcult lemplaier evaluation ufihe 
rcsulU from a Thermui lliermopMlus unknown 
LesL sample. DioTechniques J/:678. 

3. Grayburn, W.S. and T.l„ Sims. 199g. An- 
chored oligo(dT) primers for automated dye 
terminator DNA sequencing. BioTechniqucs 
.'J;340-346. 

4. Hyndman, D., A. Cooper, S. Pruzinshky, D, 
Coad and ,VI. Mltsuhashi. 1996. Software to 
delermrnc optimal oligonucleotide sequences 
based on hybridization simulation data. 
BioTechniques 20: 1 090- 1 097. 

S.lvanttich, K., L. Bibbs, R.L. Niece, N.D. 
Denslow, C.W. Nacve, M. Ruhde and L.H. 
Ericsson. 1997. CoiTimercral biotech iiistru- 
meniaiioH survey on performance, support, ser- 
vice. Genetic Engineering News 17:17. 

6.lvanetich, K., R.I^ Niece, M. Rohde, E. 
Fowlerand T.K. Hayes. 1993. Biotechnology 
core facilities', trends and update. FASEB J. 
7:110*^-1114. 

7.Kaczoniwski,T. and W.Snbalskl. 1995. Co- 
operativily of hexamer ligation. Gene / 79: 1 89- 
193. 

5. Kaczorowski, T. and W.SzybalskL 1998. Ge- 
nomic DNA sequencing by SPEL-6 primer 
walking using huxamer ligation. Gene 22J:83- 
91, 

9. Larhe, R. 1986. Synlhctie oligonucleotide 
probes deduced from amino acid sequence 
data: iheoreiical and practical considerations. J. 
Mol. Biol. /SJ-.1-12. 

10. Maniaris, T., £.F. Friisch and J. Sambrook. 
1982. Molecular Cloning; A Lahoraiory Man- 
ual. CSH Laboratory Press, Cold Spring Har- 
bor, NY. 



1 1 .McCraw, R.A., E.K. Stefle and S.M. Baxter. 
1990. Sequence-dependent oligonucleotide- 
target duplex stabilities: rales from empirical 
studies with a sei of Iwenty-mers. BioTech- 
niques *:674-678. 

l2..Naeve, C.W., G.A. Buck, R.L. Niece, R.T. 
Pon, M. Robertson and A.J. Smith. 1995. 
Accuracy of automated DNA sequencing: a 
multi-laboratory comparison of sequencing re- 
suits. BioTechniques /S':448-453. 

1 3 .PE Biosystems. 1 99 1 . Extension primers: rec- 
ommendations for sequence selection, synthe- 
sis, and purificalioii. /n User Bulletin 19. PE 
liiosy stems, Hosier City, CA. 

M.Pon, H.T., G.A. Buck, K.M. Hager, C.W. 
Naeve, R. L. Niece, M. Robertson and AJ. 
Smith. 1996. Multi-facility survey of oligonu- 
cleotide synthesis and an examination of the 
performance of unpunfied primers in automat- 
ed DNA sequencing. BioTecluiiques 2/:680- 
685. 

15. Pon, R,T., G.A. Buck, R.L. Niece, M. 
Robertson, A.J. Smith and E. Spicer. 1994. 
A survey of nucleic acid services in core labo- 
ratories. BioTechniques / 7:5 26-534. 

16. Ryeh]|k, W. and R.E. Rhoads. 1989. A com- 
puter program for choosing optimal oligonu- 
cleotides for filter hybridizaiion, sequencing 
and in vitro amplification of DNA. Nucleic 
Acids Res. /7:8543.855i. 

1 7 Strauss, E.C, J.A. Koborl, G. Slu and L.E. 
Hood. 1986. Specific-primer-directed DNA se- 
quencing. Anal. Biochem. /J.#:353-360. 

18. VUSS, H., S. Wlemann, D. Grolhues, C. 
Scnsen, J. Ziminermann, C. Schwager, J, 
SteGem«nn, H. Erfle, T. Rupp and W. An- 
sorge. 1993. Automated low-redundancy 
large-scale DNA sequencisig by primer walk- 
ing. BioTechniques /J:7I4-721. 

19. Zakerl, H., G. Amparo, S.-M. Chen, S. 
Spurgeon and P.-Y. Kwok. 1 998. Peak height 
pattern in dichloro-rhcdamine and energy 
transfer dye terminator sequencing. BioTech- 
niques .?J:'106-4M. 



Received 20 January 1999; accepted 
26 May 1999. 



Address correspondence to: 

Dr. Gregory A. Buck 

Depanmcnl of Microbiology and Immunology 
P.O. Box 980678 

Medical College ofVirginio Campus 
Virginia Commonweallh Universiiy 
Richmond. VA 2329S-067S. USA 
hih^rnei: buck^Jisc. vcu.edu 



Vol. 27, No. 3(1999) 



